Region-Restricted Clustering for Geographic Data Mining
نویسندگان
چکیده
Cluster detection for a set P of n points in geographic situations is usually dependent on land cover or another thematic map layer. This occurs for instance if the points of P can only occur in one land cover type. We extend the definition of clusters to regionrestricted clusters, and give efficient algorithms for exact computation and approximation. The algorithm determines all axis-parallel squares with exactly m out of n points inside, size at most some prespepcified value, and area of a given land cover type at most another prespecified value. The exact algorithm runs in O(nm log n + (nm + nnf ) log 2 nf ) time, where nf is the number of edges that bound the regions with the given land cover type. The approximation algorithm allows the square to be a factor 1 + ε too large, and runs in O(n log n+n/ε +nf log 2 nf +(n log 2 nf )/(mε )) time. We also show how to compute largest clusters and outliers.
منابع مشابه
A clustering approach for mineral potential mapping: A deposit-scale porphyry copper exploration targeting
This work describes a knowledge-guided clustering approach for mineral potential mapping (MPM), by which the optimum number of clusters is derived form a knowledge-driven methodology through a concentration-area (C-A) multifractal analysis. To implement the proposed approach, a case study at the North Narbaghi region in the Saveh, Markazi province of Iran, was investigated to discover porphyry ...
متن کاملInteractive Subspace Clustering for Mining High-Dimensional Spatial Patterns
The unprecedented large size and high dimensionality of existing geographic datasets make complex patterns that potentially lurk in the data hard to find. Spatial data analysis capabilities currently available have not kept up with the need for deriving the full potential of these data. “Traditional spatial analytical techniques cannot easily discover new and unexpected patterns, trends and rel...
متن کاملPreserving Privacy for Interesting Location Pattern Mining from Trajectory Data
One main concern for individuals participating in the data collection of personal location history records (i.e., trajectories) is the disclosure of their location and related information when a user queries for statistical or pattern mining results such as frequent locations derived from these records. In this paper, we investigate how one can achieve the privacy goal that the inclusion of his...
متن کاملData Warehouse Development to Identify Regions with High Rates of Cancer Incidence in México through a Spatial Data Mining Clustering Task
Data warehouses arise in many contexts, such as business, medicine and science, in which the availability of a repository of heterogeneous data sources, integrated and organized under a unified framework facilitates analysis and supports the decision making process. These data repositories increase their scope and application, when used for data mining tasks, which can extract useful knowledge,...
متن کاملAn Improved SSPCO Optimization Algorithm for Solve of the Clustering Problem
Swarm Intelligence (SI) is an innovative artificial intelligence technique for solving complex optimization problems. Data clustering is the process of grouping data into a number of clusters. The goal of data clustering is to make the data in the same cluster share a high degree of similarity while being very dissimilar to data from other clusters. Clustering algorithms have been applied to a ...
متن کامل